In [1]:
%autosave 10
map[int, float] is what I need? So wrap it in Cython!fit, predict, tranform, score, partial_fittraits, pyre.scipy.cluster.vq.kmeans is precise, slowsklearn.cluster.MiniBatchKMeans is statistical, much faster.sklean.random_projection (averages features)sklearn.utils.extmath.randomized_svdjoblib.IPython, multiprocessing, celery.joblib.Memory, memoize pattern.get, and even then only if you iterate over it.hashlib.md5, robust, no dependencies.np.save big numpy arrays.zlib.compress faster, used again because no dependencies.
In [ ]: